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ABSTRACT 

Weak lensing experiments are a powerful probe of cosmology through their measure- 
ment of the mass distribution of the universe. A challenge for this technique is to 
control systematic errors that occur when measuring the shapes of distant galaxies. In 
this paper we investigate noise bias, a systematic error that arises from second order 
noise terms in the shape measurement process. We first derive analytical expressions 
for the bias of general Maximum Likelihood estimators (MLEs) in the presence of 
additive noise. We then find analytical expressions for a simplified toy model in which 
galaxies are modeled and fitted with a Gaussian with its size as a single free param- 
eter. Even for this very simple case we find a significant effect. We also extend our 
analysis to a more realistic 6-parameter elliptical Gaussian model. We find that the 
noise bias is generically of the order of the inverse-squared signal-to-noise ratio (SNR) 
of the galaxies and is thus of the order of a percent for galaxies of SNR of 10, i.e. 
comparable to the weak lensing shear signal. This is nearly two orders of magnitude 
greater than the systematics requirements for future all-sky weak lensing surveys. We 
discuss possible ways to circumvent this effect, including a calibration method using 
simulations discussed in an associated paper. 

Key words: methods: statistical - techniques: image processing - cosmology: obser- 
vations - gravitational lensing: weak - dark matter - dark energy 



1 INTRODUCTION 



Weak gravitational lensing is a technique to map the dis- 
tribution of dark matter in the universe (see e.g. Refregier 
Hoekstra fc Jain||2008 for reviews). It relies on mea- 
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surement of the apparent shapes of distant galaxies that 
are distorted due to matter inhomogeneities along the line 
of sight. Weak lensing offers great prospects for the mea- 



surement of cosmological parameters ( Peacock & Schneider 
|2006 Albrecht et al.|2006 l. In particular, the measurements 
of dark energy parameters with future wide field surveys is 
very promising but places strong requirements on weak lens- 
ing measurements and in particular in the control of system- 
atics. 

The main potential systematic effects are generally con- 
sidered to be: (i) galaxy shape measurement from galaxy 
images; (ii) galaxy distance measurement using photometric 
redshifts; (iii) galaxy intrinsic alignments arising from the 
galaxy formation process; (iv) accuracy of theoretical pre- 
dictions of dark matter clustering. We focus on the first of 
these in this paper. 

In most cases the gravitational lensing effect produces 



a matrix distortion stretching of the galaxy image. This im- 
age shear must be uncovered in the presence of nuisance 
observational effects, including: image blurring due to the 
atmosphere and telescope optics; image pixelisation due to 
the nature of photon detectors; and noise due to the finite 
number of photons from the galaxy and other backgrounds. 
Furthermore, the intrinsic properties of the galaxy prior to 
lensing distortion are unknown. 

The first detection of this shearing effect was made 
by Tyson et al.|(1990) and repeated by Bonnet et al. (19941 



who also developed methods for removing the image con- 
volution effects. This was taken to a new level by |Kaiser| 



et al. ( 1995 1 in a method that is widely referred to as KSB 



and that has remained the most widely used shear mea- 
surement method to this day. Essentially the KSB method 
uses weighted quadrupole moments of images to calculate 
shears, and corrects the shears for the weighting function. 
This was further improved in ( |Kaiser|2000[ ). An alternative 
approach using a simple galaxy model to forward fit the data 



was proposed in jKuijken (1999J) and implemented in Bridle 
|et al.| ( p502| and [Miller et al.| i 2007| ). More fiexible models 
using Gauss-Laguerre polynomials, or shapelets, have also 
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been proposed (Bernstein & Jarvis 2002 Refregier & Ba- 
con|2003l. Each of these approaches has potential strengths 



and drawbacks. For instance the hmitations of model-fitting 
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methods were explored in Melchior et al. (20101 and Voigt 



& Bridle (20101, and potentially mitigated by Bernstein 



There have been several simulation challenges to assess 
how well current methods can measure gravitational shear 
and to encourage development of new methods. The Shear 
TEsting Program (STEP) 1 Challenge provided a suite of 
simulated images using relatively simple galaxy models but a 
realistic image blurring model. The galaxies were distributed 
with random positions across the image and the same shear 
was used to distort every galaxy in a given large image. It 
was found that the existing methods that had already been 
applied to observational data were sufficiently good to merit 
the science results on that data ( Heymans et al.|2006 l. 

The STEP2 Challenge used more realistic galaxy mod- 
els and a wider range of blurring models ( Massey et al. [20(37 1 
and reached similar conclusions despite this additional com- 
plexity. However, neither challenge was sufficiently large to 
forecast the efficacy of existing methods for use on future 
surveys, and neither challenge was able to address all po- 
tential sources of measurement bias in real data (such as 
uncertainty about the image Point Spread Function or PSF; 
see, e.g., |Paulin-Henriksson et al.|2008[[2009}|RowepoTol ). In 
addition, while it was possible in many cases to positively de- 
tect biases in weak lensing shape measurement methods, due 
to the complexity and realism of the STEP challenges it was 
not always possible to attribute definite causes for these ef- 
fects. In the complex, multi-stage analysis required for weak 
lensing measurement it can be very difficult to isolate indi- 
vidual causes of systematic bias, and yet diagnosing these 
individual contributions is an important ongoing process in 
the development of accurate measurement methodology. 

The GRavitational lEnsing Accuracy Testing 2008 
(GREAT08) Challenge was much simpler: it reverted to sim- 
pler galaxy models; avoided overlapping galaxies; and used 
similar properties for all galaxies in a large image ( |Bridle| 
et al.|[2008l. It was designed to attract new methods from 



outside the weak lensing community, in particular from com- 
puter scientists. Most significantly, the number of galaxies 
was chosen to test methods at the level required for sur- 
veys in the foreseeable future, as calculated in |Amara fc] 
[Refregier] (|2007|). A new approach won the competition, in- 
spired by'Kuijken ( 19991, which took advantage of the fact 



that the same shear was used for many galaxies at a time, by 
'stacking' the galaxies ( |Lewis|2009l|Hosseini fc Bethge|2009[ ). 
Although progress has been substantial, questions still re- 
main about the likely issues that need to be overcome to 
reach the precision needed for future all-sky surveys. 

In this paper we study noise bias, one of the system- 
atic effects that can affect weak lensing measurements. It 
arises from high-order noise terms in the measurement of 
the shape parameters of galaxies, increasing in magnitude 
at low galaxy signal-to-noise ratio (SNR). Its effects on sec- 
ond order moment measurements from convolved Gaussian 
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galaxy images has been described by Hirata et al. 

To study this effect in a forward-fitting weak lensing 
measurement context we first derive general expressions for 
the variance and bias of Maximum-Likelihood Estimators 
(MLE) of model parameters in the presence of additive 



Gaussian noise (g2|). We then apply it to a one-parameter 
toy model consisting of the Maximum-Likelihood (ML) fit- 
ting of the size of a Gaussian galaxy model to a Gaussian 
galaxy convolved with a known Gaussian PSF (§3|. While 
this model is clearly oversimplified it illustrates the principle 
of noise bias and its amplitude. We then extend this result 
by considering a more realistic model consisting of an ellip- 
tical Gaussian galaxy with 6 free parameters (Q. In sJs] we 
discuss the consequences of our findings and possible reme- 
dies, and summarise our conclusions, including a calibration 
method using simulations discussed in an associated paper 
(Kacprzak et al. 2012, hereafter K12). 



2 GENERAL 2D SHAPE ESTIMATION 

In this section, we study the general problem of the estima- 
tion of the shape parameters of a 2D object in the presence 
of additive, uncorrelated Gaussian noise. For weak lensing 
these results are applicable to the measurement of galaxy 
shapes, and to the estimation of the instrument PSF using 
stars in the image. The general analytical results that we 
derive will serve as a useful base for comparison with the 
more realistic conditions studied by K12 using numerical 
simulations. 



2.1 General results 

Let us thus consider the observed 2D surface brightness 
/obs(x) of an object that is described by a model /(x;a), 
where x is the position on the image and a is the vector of 
parameters describing the shape of the object. We can write 
the observed surface brightness as 



/obs(x) = /(x;a') +n(x) 



(1) 



where a* are the true shape parameters of the object and 
n(x) is the noise which is assumed to be uncorrelated and 
Gaussian with {n(x)) — and (n(x)^) = aj^. 

With these assumptions the log likelihood of the data 
given the model is Ini = — x^/2, where the usual x^- 
functional is given by 



X^(a) ^^f^n 



[/obs(xp) -/(xp;a)]^ 



(2) 



where the sum is over all pixels p in the image. 

The MLE a for the shape parameters of the object can 
then be constructed by requiring that x^ is minimised at 



a = a. MLEs were first studied by Fisher ( 1922 I, then later 
by |Rao| ( [19731 ) and [Cramlr] ( |1999[ |. They are commonly used 
estimators in statistics and have several desirable properties, 
including consistency, which requires that in the limit of high 
SNR the MLE recovers the true values a' of the estimated 
parameters. 

In Appendix[Alwe derive general properties for the MLE 
a using an expansion in the inverse SNR of the object. There 
and in what follows we label SNR using the parameter p. We 
first show that the covariance of the estimated parameters 
is, to leading order, given by 



cov[aj,aj] = (F ^)ij + 0{p ■*), 
where the Fisher matrix is given by 



(3) 
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■"». =5]'^' 



_2 a/(xp;a*)a/(xp;a') 
dui daj 



(4) 



We also find that the bias in the parameters b[ai\ — 
{Hi) — af^" is given by 

b[a.] = -l{F-%,{F~%i.B,,i+0{p-''), (5) 

where the summation convention over repeated indices is 
assumed and the bias tensor is given by 

,a/(xp;a')aV(xp;a*) 



Bijk ~ y cr„ 



dai daj dak 



(6) 



approximation is given by Equation |A8[ re-expressing the 
sum as an integral over the 2D image. 

Seemingly counter-intuitively, the bias of the derived 
parameters <7i(a) is not equal in general to the bias that 
would be derived had it instead been chosen to find the MLE 
of the parameters gi directly. This can be understood from 
examining the covariance transformation rule described in 
Equation [T] Thus, the exact value of the bias for any pa- 
rameter of interest may depend on the parametrisation of 
the model itself. We will show an example of this property 
in the following simplified example. 



It is often useful to consider functions gi (a) of the pa- 
rameters. The covariances of these functions are given by 



. , dgi dgj . , 

cov[sf,,5j] = -— -— cov[afe,a;], 
Oat^ oai 

while their bias is given by 



(7) 



(8) 



3 CIRCULAR GAUSSIAN MODEL 

To illustrate the above results, we first consider the case 
of the measurement of the size of a 2D, circular, Gaussian 
galaxy without any other free parameters. This is a highly 
simplified illustration of the shape measurement problem 
and should therefore be considered as a toy model that cap- 
tures the main features of the effect of noise bias. 



2.2 Properties 

As can be seen from Equation [5] the bias tensor depends 
on second order derivatives of the model /(x, a) in the pa- 
rameters a and therefore vanishes for linear models. This re- 
states the known fact that MLEs may be biased in general, 
except in the case of linear models. As noted in Appendix [A} 
the present bias arises from second order noise terms and is 
therefore referred to as 'noise bias'. 

We then note that the squared error in the parameters 



cov ai, ai 



and the bias b[ai] are of order 



b[ai]/al ~ [a[a,]/alf 



(9) 



in dimensionless units. In the limit of high SNR, p — >■ oo, 
both tend to zero so that the estimator tends to the true 
value a — >■ a*, thus recovering the consistency property of 



MLEs (e.g. |Cramer||1999 K 

For finite values of p the statistical error and bias of 
the parameters can be non negligible. Weak lensing shape 
measurements are typically performed down to p ~ 10 to 
maximise the surface density of galaxies. In this case, the 
statistical RMS error will be of order p~^ ~ 0.1 which is 
consistent with the typical observed shape noise per galaxy 
of about 5y ~ 0.3, and which includes this statistical mea- 
surement error and the distribution of the intrinsic shape of 
galaxies. The bias in the parameters in this regime will be 
of order p~^ ~ 0.01 which is comparable to the weak lens- 
ing shear signal 7 ~ 0.02 and may contribute to explain why 



some methods do not perform better (e.g. Bridle et al. 2008 1. 
As shown by Amara & Refregier (2007), the requirement for 
the variance of the shear systematics is a^ys ~ lO"'^ which 
corresponds to the systematic shear error of ^7 ~ 3 x 10~*. 
This is almost two orders of magnitude smaller than pre- 
dicted by the current analysis of noise bias. 

We also note that the expressions for the variance and 
bias of the ML estimators are expressed in Equations [3] and 
[5] in terms of a sum over pixel positions, but can often be 
more conveniently evaluated in the continuum limit where 
the pixel size is small compared to the object size. This 



3.1 Case without PSF convolution 

First, let us consider the case where the galaxy is not con- 
volved with the PSF of the instrument. In this case the 
galaxy surface brightness is given by 



/(x;a) = /oexp 



r 
'2^ 



(10) 



where r^ = x\ -\- x^, Jq \s, & & normalization controlling the 
flux of the galaxy, and a is the rms size. 

To characterise the SNR p of the galaxy, we temporar- 
ily consider /o as the free parameter while keeping a fixed. 
Using the continuum limit (Eq. |A8[ ), we can analytically in- 
tegrate Equation |3] for the variance of the estimator for /o 
and obtain 



P[/o] 



/o 



VT^fpa 

(Jnh 



(11) 



where h is the pixel scale and a„ is the noise rms as in ip] 
Here and in the following, we drop the " and * symbols 
to simplify the notation when it does not lead to ambigu- 
ities. This definition of the SNR can be considered as the 
ideal detection signal-to-noise of the galaxy, corresponding 
to a perfect knowledge of the galaxy shape and position, or 
equivalently to an ideal matched filter. 

Now, considering a as the only free parameter (and thus 
leaving /o fixed) we again integrate Equation ^ analytically 
in the continuum limit and obtain 



1 



= ^p[for' + o{p- 



(12) 



Integrating Equation^ 



which scales as p~^ as noted in [ 2.2 
we find that the bias in this case vanishes at second order, 
i.e. b[a] = + O{p~'^). This is due to a cancelation which we 
find occurs for any 2D circular galaxy model which can be 
written as 



/(x;a) = fo4>(r/a 



(13) 



where <jf> is any function describing the galaxy profile. In- 
terestingly, this cancelation only occurs in two dimensions. 
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and the second order bias term does not vanish in 1 or > 2 
dimensions even if the above scahng symmetry holds. 



3.2 Case with PSF convolution 

Let us now consider the case of interest in practice where 
the circular Gaussian galaxy is convolved with a PSF due 
to the instrument and the atmosphere. In the spirit of the 
toy model, we make the simplifying assumption that the 
PSF is itself circular and Gaussian with an rms size p. Since 
the convolution of two Gaussians is another Gaussian with 
standard deviations adding in quadrature, the model in this 



/(x;a) = /oexp 



2(a2+p2) 



(14) 



where /o is a normalization controlling the flux of the galaxy. 
The PSF size p is assumed known and the noise assumed to 
be Gaussian with an rms of (t„ . 

In this case, the ideal SNR defined as in SsTTl becomes 



/o V^fo^ja^+P'^ 



a"[/o] Onh 

and the uncertainty in the MLE for a is given by 



cr[a] _ 1 
a ^ ^/2 



1 + 



(9' 



P[/o] 



o{p- 



(15) 



(16) 



Because the presence of the PSF breaks the scaling symme- 
try of Equation [l3J the bias does not vanish to second order 
and we find 



M 

a 



1 + 



©1 ©•*" 



~' + 0{p'') 



(17) 



We note that we recover the results of §3. 1| without a PSF 
when we set p = in the expressions above. We also see 
that the scalings of Equation [9] hold for the rms error and 
bicis with pre-factors that depend on the ratio of the galaxy 
to PSF size. 

We also notice that if we had instead estimated the con- 
volved galaxy parameters directly, and obtained the decon- 
volved size as a derived parameter (using Equation [8|, the 
bias would then have vanished to this order. But this would 
also allow unphysical values of the parameters, with a? < 0. 
This is an illustration of the fact that the bias of physical pa- 
rameters depends in general on the specific parametrisation 
of the estimated model, as discussed in 32.21 



3.3 Simulations 

In order to check the validity of the expansion described in 
!|2] and Appendix lAJ and gain insights in the origin of the 
bias, we performed numerical simulations of this toy model. 
We considered a range of SNR for a circular Gaussian galaxy 
of true size a* = 4 convolved with a circular Gaussian PSF 
of size p = 5.33 pixels. This corresponds to a rat io of the 
convolved galaxy size to the PSF size of i/a'^ -|- p^/p ~ 1.25 
which is typically used as the limit for weak lensing surveys. 
Even for this very simple one-parameter toy model, we 
find that great care must be taken for the implementation 
of the minimization of the x^ function. Readily available 



0.4 



0.3 



Oh 



0.2 t 



0.1 




0.0 

4 6 

a (pixels) 

Figure 1. Distribution P{a) of the MLE estimator for the size a 
of the 2D Gaussian from repeated realisations. The curved from 
broad (thin lines) to sharp (thick lines) correspond to SNR p[fo] 
of 3, 5, 7, 9, 12, 17, 25, 40, respectively. The true value of the 
parameter a* = 4 pixels is shown as the vertical dashed line. The 
PSF size is p = 5.33 pixels corresponding to a convolved galaxy 
size to the PSF size of ^a" +p^/p ~ 1.25. 



minimiser algorithms produced artifacts that appear to de- 
pend on the choice of algorithm at the high level of preci- 
sion required for weak lensing. For the present simulations, 
we computed x^i^-) on a grid in the interval 0.1 < a < 10 
pixels with a grid size of Aa = 0.0033 pixels and found the 
minimum by direct search. 

Figure IT] shows the resulting Probability Distribution 
Function (PDF) P{a) of the estimator for a for a range of 
SNR p[fo]. At high SNR, the PDF is nearly Gaussian and 
peaks close to the true value a' = 4 pixels. As the SNR de- 
creases, the PDF main peak shifts towards the right, while a 
secondary peak at a = starts developing. The complicated 
combination of these two effects contribute to the depen- 
dence of the bias and rms error on SNR. 

Figure PI shows the dependence of the rms variance a[a] 
on the SNR p[fo] ■ We see that our expression in Equationis] 
is a good approximation for p[/o] ^ 10. The deviations below 
this value are not surprising since higher order terms are 
expected to become important in the low SNR limit. 

Figure [3] shows the dependence of the bias b[a] on the 
SNR p[fo]- Again, our expression in Equationlslis a good ap- 
proximation for SNR > 10, with deviations below this value 
likely due to higher order terms. The horizontal, dashed line 
in the lower panel corresponds to the requirement for fut ure 
all sky surveys b[a]/a ~ (57 ~ 3 x 10~* as discussed in f2.2 



and in Amara & Refregier ( |2007[ ) . The bias for galaxies with 
SNR ~ 10 in this toy model is b[a]/a ~ 0.015, which is nearly 
two order of magnitude greater than this requirement and 
comparable to the expected weak lensing signal Sy ~ 0.02. 
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Figure 2. Standard deviation a[a\ of the size estimator a as 
a function of signal-to-noise ratio SNR = p[/o]- The expectation 
from the analytical prediction (solid line) is compared to the mea- 
surements from repeated experiments. 



4 ELLIPTICAL GAUSSIAN MODEL 

We now consider a slightly more realistic model of a galaxy 
consisting of a 2D elliptical Gaussian galaxy with 6 param- 
eters. This is the smallest number of parameters needed to 
measure galaxy shapes for weak lensing in practice as they 
correspond to the 2 centroid coordinates, flux, major and 
minor axes and position angle of the galaxy. In practice the 
models for galaxies are typically non-Gaussian and more 
complicated in order to better describe realistic galaxies. 
The 6-parameter Gaussian model is nevertheless useful to 
study the behaviour of the bias in the multi-parameter case. 
Let us thus consider a galaxy surface brightness given 
by a 2D elliptical Gaussian (without PSF convolution) that 
we parametrise as (see also |Paulin-Henriksson et al.|2008| for 
a slightly different parametrisation) : 



/(x;a) 



A 



2Tv^aia2 



exp 



--(X 
2^ 



x")^A-^(x- 



(18) 



where ai and a2 are the (rms) major and minor axes of the 
Gaussian respectively, A is a parameter which determines 
the amplitude, x" is the centroid, and "^ denotes the trans- 
pose operator. The quadrupole moment matrix A is a 2 x 2 
symmetric matrix which can be written as 



R(a) 







R(a) 



(19) 



where a is the position angle of the major axis counter- 
clockwise from the a;-axis and 



R(a) = 



cos a 
— sin a 



smof 
cos a 



(20) 



is the rotation matrix which aligns the coordinate system 
with the major axis. In Appendix [Bl we show that the am- 
plitude, centroid, and quadrupole moment matrix are simply 




0.001 r 



Figure 3. Bias b[a] of the size estimator a as a function of SNR = 
p[fo] in linear (upper panel) and log (lower panel) axes. In both 
panels, the solid line correspond to the analytical model while the 
squares were derived from simulations of repeated experiments. 
The horizontal dashed line in the bottom panel corresponds to 
the requirement for future all sky surveys (see text). 



related to the multipole moments of galaxy surface bright- 
ness. 

Let us consider the MLEs for this 2-dimensional Gaus- 
sian with the following 6 free parameters 



{xi,X2,A,ai,a2,a) . 



(21) 



In the continuum limit (Eq. A8l, the Fisher matrix equa- 
tions Q can be computed analytically. This is facilitated by 
rotating into the coordinate system aligned with the major 
and minor axes of the galaxy, before performing the inte- 
gral of the surface of the galaxy. The resulting Fisher ma- 
trix Fij and covariance matrix cov[ai, Ui] of the parameters 
are given in Equation |B4| in Appendix [B] We note that, 
with this parametrisation, the Fisher matrix is conveniently 
nearly diagonarj The corresponding rms statistical errors 



^ Note that in the parametrisation of [Paulin-Henriksson et al.| 
l |2008[ l the Fisher matrix is also not quite diagonal, due to covari- 
ance terms between their rotated centroid and the position angle 
neglected in this paper. 
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a[ai\ = cov[ai, a,;]^" are listed in the third column in the 
top part of Table [l] to leading order in the SNR of the am- 
plitude A, defined as 



P[A] 



A 

^A] 



A 



(22) 



/A-KhOn 

Using Equation (|5| after the same change of coordinates 
and some cumbersome algebra, we can also derive the bias 
in the model parameters which are given in Equation |B5| 
and listed in the last column of Table [l] Note that several 
of the parameters have a singularity as the galaxy becomes 
circular, i.e. when e = 0, which occurs at a^ — 02- This 
follows from the fact that, in this limit, the position angle a 
becomes degenerate. 

From these expressions, and using Equations ( 7|8 l, we 
can derive the error and bias for derived quantities. The 
lower part of Table IT] provides the definition, statistical er- 
rors and biases for several common ly u sed parameters such 
as the flux F^^' (see also Equation Bll, the average radius 



V^ 



a = -i/ of -|- flj and two definitions of the ellipticities e and 
e. All these parameters are biased to second order in p[A]. 
Interestingly, the average radius a is biased in the elliptical 
case even in the absence of a PSF convolution, as a result 
of covariances with other parameters. Also, it is interesting 
to note that the rms error and bias of oi and 02 have a 
singularity in the circular case, but that the singularity can- 
cels when they are combined to form the mean radius a. We 
can also verify that the scaling of Equation [9] for the vari- 
ance and bias of the parameters holds up to multiplicative 
factors of order unity. 



5 CONCLUSIONS 

In this paper we have studied the effect of noise bias on 
MLEs for weak lensing shape parameters in the presence 
of additive Gaussian noise. We have derived general expres- 
sions for the covariance and bias for ML-estimated param- 
eters of 2D galaxy images, which are given by Equations [3] 
and|5] The bias vanishes for linear models, but is generally 
non-zero for models which are non-linear in the parameters 
and depend on the model parametrisation. To illustrate the 
effect of the noise bias we have calculated analytical expres- 
sions for the variance and bias for a toy model consisting of a 
2D circular Gaussian galaxy, convolved with a circular Gaus- 
sian PSF, with the galaxy size as a single free parameter. 
We have compared these predictions with careful numerical 
simulations and found them to be in good agreement. We 
also provide analytical results for a 2D elliptical Gaussian 
with 6-parameters. 

We find that the variance and bias of the parameters are 
generically of order p~^, where p is the SNR of the galaxies. 
For galaxies with p ~ 10, which is typical of weak lens- 
ing surveys, this implies a bias in the parameters of order 
p~^ ~ 0.01. This is comparable to the weak lensing shear 
signal 7 ~ 0.02 and nearly two orders of magnitude greater 
than the systematic shear tolerance required for future all 
sky surveys, ^7 ~ 3 x 10"''. Although derived using the spe- 
cific case of the MLE in the presence of Gaussian noise, our 
results are likely to be generic across a number of measure- 
ment techniques. This may contribute towards explaining 
why current weak lensing surveys are limited by systemat- 



ics, and why finding sufficiently accurate methods has been 
difficult. 

To solve this problem, the following ways forward are 
possible 

• Use higher SNR galaxies. This is an obvious solution 
but it is costly in practice as it leads to a sharp drop in the 
useful surface density of galaxies, or requires longer exposure 
times and/or larger telescopes. 

• Avoid non-linearities, either by choosing linear models 
or using moment-based methods. This was the idea behind 
shapelets methods, but we note that in all cases some level 
of non-linearity is unavoidable as the centroid and size (of 
the basis functions or weight function) are intrinsically non- 
linear. 

• Go beyond ML estimation by using, e.g., Bayesian 
methods or other averaging techniques. For instance, the 
introduction of stacking methods was a breakthrough in the 
GREAT08 challenge ( [Bridle et al.|2008| |. 

• Calibrate the bias. While order-by-order correction 
methods exist for the MLEs, the models used in practice 
to model galaxies (e.g., exponential, de Vaucouleur, Ser- 
sic, or a multi-component combination) convolved with ob- 
served PSFs will be complex and thus not offer analytical 
expressions for the bias. Instead, numerical simulations will 
be needed for the bias calibration. This is the approach 
described in the accompanying paper by Kacprzak et al. 
(2012). 
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Table 1. Statistical errors and biases of parameters for 6-parameter elliptical Gaussian model. 
Errors and biases are shown to leading order in p 
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Derived Parameters 



Flux F° = Ay/a^a^ \f2 

Radius mean a = \J a\ + a^ VT-pe^ 
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APPENDIX A: BIAS FOR GENERAL MLE 

In this Appendix, we provide the derivation of the main re- 
sults for the variance and bias of a general MLE of param- 
eters given in 32] for additive, uncorrelated Gaussian noise. 
For the model describe in !J2] , the likelihood is 

L oc e"'^'/^ (Al) 

where 

x'(a) = ^<T-2 [/(xp;a*) +n(xp) - ,/(xp;a)]'. (A2) 

V 

The MLE a is then defined as the value of the parameters 
a which maximises the likelihood L or, equivalently, which 
minimises x^ 



da 



= 0. 



(A3) 



To proceed, we expand this expression in terms of the 
inverse of the SNR, p ~ f/n, of the object. This can be 
conveniently done by rewritting n(xp) — > an{xp) and a = 
a* +a5sS^' +a'^5sL^^' -I- ■ ■ ■, where a is a dimensionless order 
parameter which scales as a ~ p~^. We then Taylor ex pand 
f{x,a) about a 
and set a 



collect like powers of a in Equation A3 
1. The terms of order a yield 



Sa'^^ = (F-%, 



V 



2 / 

n(Xy 



^ '9/(xp;a) 
9a, 



(A4) 



where the fisher matrix Fij was defined in Equation |4] Tak- 
ing the average of this expression and using the fact that the 
noise is unbiased, i.e. {n(xp)) = 0, we see that the estimator 
is unbiased to this order. Taking the average of the product 



{5a\ SaS''') and using the fact that the noise is uncorre- 



lated, i.e. (n(xp)n(xp')) = 5p,p'CT„, we obtain the expression 
for the covariance of the parameters to leading order given 
in Equation [3] 

The terms of order a^ yield 



5a?^ = 



-AF U{F-')kiBiki, 



(A5) 
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and thus gives Equation [5] for the bias to leading order, 
where the bias tensor was defined in Equation [6J 

These results can also be derived from the general ex- 
pressions of the Fisher Matrix 



yinL 
daidai 



and for the bias tensor for the MLE 

, 1 9^1nL dlnLd'^lnL 



2 daidaidak 



daj daidak 



(A6) 



(A7) 



In the limit of small pixels, the sum over pixels in the 
expressions above can be approximated by a continuous in- 
tegral 

■ 0, (A8) 



p 



h^ 



as h 



where h is the pixel size. 



The bias in the parameters can be derived using Equa- 
tion [S] which yields 

b[ai 
a 

where the quadratic ellipticity e is defined in Table IT]. 

The resulting rms errors and biases for the model pa- 
rameters are summarised in the upper part of Table IT] 



^ = {0,0,l,e~\-e-',0)p[A]-' + O{p- 



(B5) 



APPENDIX B: RESULTS FOR THE 
ELLIPTICAL GAUSSIAN MODEL 

In this Appendix, we provide results for the elliptical Gaus- 
sian model defined in Equation [18] as a function of the 6 
parameters listed in Equation |21| in the presence of addi- 
tive, uncorrelated Gaussian noise as defined in Equation [Tj 
With this parametrisation, the flux, or zeroth order mo- 
ment of the Gaussian is given by 



F 



(0) 



d X /(x;a) = Ay/a\a2. 



The centroid, or first order moments are given by 



(1) 



F(0) 



1 



d X Xif{x.; a) — x1, 



(Bl) 



(B2) 



and the quadrupole moment matrix, containing the second 
order moments, is given by 

p{2) 



F'- 



1 



d X XiXjf{x.;a) — Ai 



(B3) 



For this model, in the continuum limit (Eq. AS I, the 
Fisher Matrix Fij (Eq. |4| can be derived analytically by ro- 
tating into a coordinate system with axes parallel to the ma- 
jor and minor axes of the galaxy (using the rotation matrix 



in Eq. 20 1 before performing the integrals over the surface 



of the galaxy. The covariance matrix of the parameters is 
then obtained by inverting the Fisher matrix (Eq. ^ which 
yields 

/ 



2Aii 


2Ai2 














2Ai2 


2A22 




















A' 




















2ai 
























2ai 






(aj-aly^ 



COV Oi, Oj 



V 

xp[A]-^ + 0{p-'), (B4) 

where Aij are the components of the quadrupole moment 
matrix A defined in Equation |19| and p[A] is the SNR of 
the amplitude A given in Equation |22[ 
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